[Kernel] Add punica dimension for Qwen1.5-32B LoRA #4850

Silencioo · 2024-05-16T04:42:04Z

This pr adds support for the Qwen1.5-32B model with LoRA.

To enable Qwen1.5-32B to support LoRA, it should incorporate an additional size into Punica.

Co-authored-by: Silencio <[email protected]>

Add punica dimension 27648

2dce7c4

simon-mo approved these changes May 16, 2024

View reviewed changes

WoosukKwon merged commit 8435b20 into vllm-project:main May 16, 2024

robertgshaw2-redhat pushed a commit to neuralmagic/nm-vllm that referenced this pull request May 19, 2024

[Kernel] Add punica dimension for Qwen1.5-32B LoRA (vllm-project#4850)

1a745a3

Co-authored-by: Silencio <[email protected]>

dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024

[Kernel] Add punica dimension for Qwen1.5-32B LoRA (vllm-project#4850)

b551e55

Co-authored-by: Silencio <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Kernel] Add punica dimension for Qwen1.5-32B LoRA #4850

[Kernel] Add punica dimension for Qwen1.5-32B LoRA #4850

Uh oh!

Silencioo commented May 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Kernel] Add punica dimension for Qwen1.5-32B LoRA #4850

[Kernel] Add punica dimension for Qwen1.5-32B LoRA #4850

Uh oh!

Conversation

Silencioo commented May 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants